Overview

Dataset Statistics

Number of Variables 14
Number of Rows 3150
Missing Cells 0
Missing Cells (%) 0.0%
Duplicate Rows 300
Duplicate Rows (%) 9.5%
Total Size in Memory 344.7 KB
Average Row Size in Memory 112.0 B
Variable Types
  • Numerical: 8
  • Categorical: 6

Dataset Insights

Call Failure is skewed Skewed
Charge Amount is skewed Skewed
Frequency of SMS is skewed Skewed
Customer Value is skewed Skewed
Dataset has 300 (9.52%) duplicate rows Duplicates
Complains has constant length 1 Constant Length
Age Group has constant length 1 Constant Length
Tariff Plan has constant length 1 Constant Length
Status has constant length 1 Constant Length
Age has constant length 2 Constant Length
Churn has constant length 1 Constant Length
Call Failure has 702 (22.29%) zeros Zeros
Charge Amount has 1768 (56.13%) zeros Zeros
Frequency of SMS has 603 (19.14%) zeros Zeros
  • 1
  • 2

Variables


Call Failure

numerical

Approximate Distinct Count 37
Approximate Unique (%) 1.2%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 50400
Mean 7.6279
Minimum 0
Maximum 36
Zeros 702
Zeros (%) 22.3%
Negatives 0
Negatives (%) 0.0%
  • Call Failure is skewed right (γ1 = 1.0892)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 1
Median 6
Q3 12
95-th Percentile 22
Maximum 36
Range 36
IQR 11

Descriptive Statistics

Mean 7.6279
Standard Deviation 7.2639
Variance 52.764
Sum 24028
Skewness 1.0892
Kurtosis 0.9035
Coefficient of Variation 0.9523
  • Call Failure is not normally distributed (p-value 2.667287018321437e-17)
  • Call Failure has 47 outliers

Complains

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 207900
  • The largest value (0) is over 12.07 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 3150
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 12.07 times larger than the second largest value (1)
  • Complains has words of constant length

Subscription Length

numerical

Approximate Distinct Count 45
Approximate Unique (%) 1.4%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 50400
Mean 32.5419
Minimum 3
Maximum 47
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Subscription Length is skewed left (γ1 = -1.2994)

Quantile Statistics

Minimum 3
5-th Percentile 13
Q1 30
Median 35
Q3 38
95-th Percentile 42
Maximum 47
Range 44
IQR 8

Descriptive Statistics

Mean 32.5419
Standard Deviation 8.5735
Variance 73.5046
Sum 102507
Skewness -1.2994
Kurtosis 1.212
Coefficient of Variation 0.2635
  • Subscription Length is not normally distributed (p-value 0.00010545352373785397)
  • Subscription Length has 282 outliers

Charge Amount

numerical

Approximate Distinct Count 11
Approximate Unique (%) 0.3%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 50400
Mean 0.9429
Minimum 0
Maximum 10
Zeros 1768
Zeros (%) 56.1%
Negatives 0
Negatives (%) 0.0%
  • Charge Amount is skewed right (γ1 = 2.5836)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0
Q3 1
95-th Percentile 4
Maximum 10
Range 10
IQR 1

Descriptive Statistics

Mean 0.9429
Standard Deviation 1.5211
Variance 2.3137
Sum 2970
Skewness 2.5836
Kurtosis 8.8384
Coefficient of Variation 1.6133
  • Charge Amount is not normally distributed (p-value 9.62129555836803e-22)
  • Charge Amount has 370 outliers

Seconds of Use

numerical

Approximate Distinct Count 1756
Approximate Unique (%) 55.8%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 50400
Mean 4472.4597
Minimum 0
Maximum 17090
Zeros 154
Zeros (%) 4.9%
Negatives 0
Negatives (%) 0.0%
  • Seconds of Use is skewed right (γ1 = 1.3213)

Quantile Statistics

Minimum 0
5-th Percentile 54.5
Q1 1391.25
Median 2990
Q3 6478.25
95-th Percentile 15020.5
Maximum 17090
Range 17090
IQR 5087

Descriptive Statistics

Mean 4472.4597
Standard Deviation 4197.9087
Variance 1.7622e+07
Sum 1.4088e+07
Skewness 1.3213
Kurtosis 0.9902
Coefficient of Variation 0.9386
  • Seconds of Use is not normally distributed (p-value 0.0035984119307757717)
  • Seconds of Use has 200 outliers

Frequency of use

numerical

Approximate Distinct Count 242
Approximate Unique (%) 7.7%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 50400
Mean 69.4606
Minimum 0
Maximum 255
Zeros 154
Zeros (%) 4.9%
Negatives 0
Negatives (%) 0.0%
  • Frequency of use is skewed right (γ1 = 1.1436)

Quantile Statistics

Minimum 0
5-th Percentile 1
Q1 27
Median 54
Q3 95
95-th Percentile 184.55
Maximum 255
Range 255
IQR 68

Descriptive Statistics

Mean 69.4606
Standard Deviation 57.4133
Variance 3296.2879
Sum 218801
Skewness 1.1436
Kurtosis 0.8169
Coefficient of Variation 0.8266
  • Frequency of use has 129 outliers

Frequency of SMS

numerical

Approximate Distinct Count 405
Approximate Unique (%) 12.9%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 50400
Mean 73.1749
Minimum 0
Maximum 522
Zeros 603
Zeros (%) 19.1%
Negatives 0
Negatives (%) 0.0%
  • Frequency of SMS is skewed right (γ1 = 1.9732)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 6
Median 21
Q3 87
95-th Percentile 356.55
Maximum 522
Range 522
IQR 81

Descriptive Statistics

Mean 73.1749
Standard Deviation 112.2376
Variance 12597.2698
Sum 230501
Skewness 1.9732
Kurtosis 3.2515
Coefficient of Variation 1.5338
  • Frequency of SMS is not normally distributed (p-value 5.951521882870113e-21)
  • Frequency of SMS has 368 outliers

Distinct Called Numbers

numerical

Approximate Distinct Count 92
Approximate Unique (%) 2.9%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 50400
Mean 23.5098
Minimum 0
Maximum 97
Zeros 154
Zeros (%) 4.9%
Negatives 0
Negatives (%) 0.0%
  • Distinct Called Numbers is skewed right (γ1 = 1.0289)

Quantile Statistics

Minimum 0
5-th Percentile 1
Q1 10
Median 21
Q3 34
95-th Percentile 51
Maximum 97
Range 97
IQR 24

Descriptive Statistics

Mean 23.5098
Standard Deviation 17.2173
Variance 296.4367
Sum 74056
Skewness 1.0289
Kurtosis 1.3559
Coefficient of Variation 0.7323
  • Distinct Called Numbers is not normally distributed (p-value 4.372702717747554e-10)
  • Distinct Called Numbers has 77 outliers

Age Group

categorical

Approximate Distinct Count 5
Approximate Unique (%) 0.2%
Missing 0
Missing (%) 0.0%
Memory Size 207900

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 3
2nd row 2
3rd row 3
4th row 1
5th row 1

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 3150
  • The top 2 categories (3, 2) take over 50.0%
  • Age Group has words of constant length

Tariff Plan

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 207900
  • The largest value (1) is over 11.86 times larger than the second largest value (2)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 1
2nd row 1
3rd row 1
4th row 1
5th row 1

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 3150
  • The top 2 categories (1, 2) take over 50.0%
  • The largest value (1) is over 11.86 times larger than the second largest value (2)
  • Tariff Plan has words of constant length

Status

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 207900
  • The largest value (1) is over 3.03 times larger than the second largest value (2)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 1
2nd row 2
3rd row 1
4th row 1
5th row 1

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 3150
  • The top 2 categories (1, 2) take over 50.0%
  • The largest value (1) is over 3.03 times larger than the second largest value (2)
  • Status has words of constant length

Age

categorical

Approximate Distinct Count 5
Approximate Unique (%) 0.2%
Missing 0
Missing (%) 0.0%
Memory Size 211050

Length

Mean 2
Standard Deviation 0
Median 2
Minimum 2
Maximum 2

Sample

1st row 30
2nd row 25
3rd row 30
4th row 15
5th row 15

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 6300
  • The top 2 categories (30, 25) take over 50.0%
  • Age has words of constant length

Customer Value

numerical

Approximate Distinct Count 2654
Approximate Unique (%) 84.2%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 50400
Mean 470.9729
Minimum 0
Maximum 2165.28
Zeros 132
Zeros (%) 4.2%
Negatives 0
Negatives (%) 0.0%
  • Customer Value is skewed right (γ1 = 1.4266)

Quantile Statistics

Minimum 0
5-th Percentile 10.335
Q1 113.8012
Median 228.48
Q3 788.3888
95-th Percentile 1587.68
Maximum 2165.28
Range 2165.28
IQR 674.5875

Descriptive Statistics

Mean 470.9729
Standard Deviation 517.0154
Variance 267304.9578
Sum 1.4836e+06
Skewness 1.4266
Kurtosis 1.2206
Coefficient of Variation 1.0978
  • Customer Value is not normally distributed (p-value 2.372575129632763e-07)
  • Customer Value has 116 outliers

Churn

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 207900
  • The largest value (0) is over 5.36 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 3150
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 5.36 times larger than the second largest value (1)
  • Churn has words of constant length

Interactions

Correlations

Missing Values